40 research outputs found
Factor Analysis for Spectral Estimation
Power spectrum estimation is an important tool in many applications, such as
the whitening of noise. The popular multitaper method enjoys significant
success, but fails for short signals with few samples. We propose a statistical
model where a signal is given by a random linear combination of fixed, yet
unknown, stochastic sources. Given multiple such signals, we estimate the
subspace spanned by the power spectra of these fixed sources. Projecting
individual power spectrum estimates onto this subspace increases estimation
accuracy. We provide accuracy guarantees for this method and demonstrate it on
simulated and experimental data from cryo-electron microscopy.Comment: 5 pages, 3 figures; 12th International Conference Sampling Theory and
Applications, July 3-7, 2017, Tallinn, Estoni
Structural Variability from Noisy Tomographic Projections
In cryo-electron microscopy, the 3D electric potentials of an ensemble of
molecules are projected along arbitrary viewing directions to yield noisy 2D
images. The volume maps representing these potentials typically exhibit a great
deal of structural variability, which is described by their 3D covariance
matrix. Typically, this covariance matrix is approximately low-rank and can be
used to cluster the volumes or estimate the intrinsic geometry of the
conformation space. We formulate the estimation of this covariance matrix as a
linear inverse problem, yielding a consistent least-squares estimator. For
images of size -by- pixels, we propose an algorithm for calculating this
covariance estimator with computational complexity
, where the condition number
is empirically in the range --. Its efficiency relies on the
observation that the normal equations are equivalent to a deconvolution problem
in 6D. This is then solved by the conjugate gradient method with an appropriate
circulant preconditioner. The result is the first computationally efficient
algorithm for consistent estimation of 3D covariance from noisy projections. It
also compares favorably in runtime with respect to previously proposed
non-consistent estimators. Motivated by the recent success of eigenvalue
shrinkage procedures for high-dimensional covariance matrices, we introduce a
shrinkage procedure that improves accuracy at lower signal-to-noise ratios. We
evaluate our methods on simulated datasets and achieve classification results
comparable to state-of-the-art methods in shorter running time. We also present
results on clustering volumes in an experimental dataset, illustrating the
power of the proposed algorithm for practical determination of structural
variability.Comment: 52 pages, 11 figure
Multitaper estimation on arbitrary domains
Multitaper estimators have enjoyed significant success in estimating spectral
densities from finite samples using as tapers Slepian functions defined on the
acquisition domain. Unfortunately, the numerical calculation of these Slepian
tapers is only tractable for certain symmetric domains, such as rectangles or
disks. In addition, no performance bounds are currently available for the mean
squared error of the spectral density estimate. This situation is inadequate
for applications such as cryo-electron microscopy, where noise models must be
estimated from irregular domains with small sample sizes. We show that the
multitaper estimator only depends on the linear space spanned by the tapers. As
a result, Slepian tapers may be replaced by proxy tapers spanning the same
subspace (validating the common practice of using partially converged solutions
to the Slepian eigenproblem as tapers). These proxies may consequently be
calculated using standard numerical algorithms for block diagonalization. We
also prove a set of performance bounds for multitaper estimators on arbitrary
domains. The method is demonstrated on synthetic and experimental datasets from
cryo-electron microscopy, where it reduces mean squared error by a factor of
two or more compared to traditional methods.Comment: 28 pages, 11 figure
Covariance estimation using conjugate gradient for 3D classification in Cryo-EM
Classifying structural variability in noisy projections of biological
macromolecules is a central problem in Cryo-EM. In this work, we build on a
previous method for estimating the covariance matrix of the three-dimensional
structure present in the molecules being imaged. Our proposed method allows for
incorporation of contrast transfer function and non-uniform distribution of
viewing angles, making it more suitable for real-world data. We evaluate its
performance on a synthetic dataset and an experimental dataset obtained by
imaging a 70S ribosome complex
Extended playing techniques: The next milestone in musical instrument recognition
The expressive variability in producing a musical note conveys information
essential to the modeling of orchestration and style. As such, it plays a
crucial role in computer-assisted browsing of massive digital music corpora.
Yet, although the automatic recognition of a musical instrument from the
recording of a single "ordinary" note is considered a solved problem, automatic
identification of instrumental playing technique (IPT) remains largely
underdeveloped. We benchmark machine listening systems for query-by-example
browsing among 143 extended IPTs for 16 instruments, amounting to 469 triplets
of instrument, mute, and technique. We identify and discuss three necessary
conditions for significantly outperforming the traditional mel-frequency
cepstral coefficient (MFCC) baseline: the addition of second-order scattering
coefficients to account for amplitude modulation, the incorporation of
long-range temporal dependencies, and metric learning using large-margin
nearest neighbors (LMNN) to reduce intra-class variability. Evaluating on the
Studio On Line (SOL) dataset, we obtain a precision at rank 5 of 99.7% for
instrument recognition (baseline at 89.0%) and of 61.0% for IPT recognition
(baseline at 44.5%). We interpret this gain through a qualitative assessment of
practical usability and visualization using nonlinear dimensionality reduction.Comment: 10 pages, 9 figures. The source code to reproduce the experiments of
this paper is made available at:
https://www.github.com/mathieulagrange/dlfm201